Search CORE

29 research outputs found

Precedence-constrained scheduling problems parameterized by partial order width

Author: HL Bodlaender
J Du
J Ullman
JK Lenstra
M Mnich
MK Warmuth
MR Garey
R Bevern van
Romeo Rizzi
S Felsner
SB Akers Jr
VV Servakh
WW Hardgrave
Publication venue
Publication date: 01/01/2016
Field of study

Negatively answering a question posed by Mnich and Wiese (Math. Program. 154(1-2):533-562), we show that P2|prec,

p_j{\in}\{1,2\}

C_{\max}

, the problem of finding a non-preemptive minimum-makespan schedule for precedence-constrained jobs of lengths 1 and 2 on two parallel identical machines, is W[2]-hard parameterized by the width of the partial order giving the precedence constraints. To this end, we show that Shuffle Product, the problem of deciding whether a given word can be obtained by interleaving the letters of

k

other given words, is W[2]-hard parameterized by

k

, thus additionally answering a question posed by Rizzi and Vialette (CSR 2013). Finally, refining a geometric algorithm due to Servakh (Diskretn. Anal. Issled. Oper. 7(1):75-82), we show that the more general Resource-Constrained Project Scheduling problem is fixed-parameter tractable parameterized by the partial order width combined with the maximum allowed difference between the earliest possible and factual starting time of a job.Comment: 14 pages plus appendi

arXiv.org e-Print Archive

Crossref

HAL Descartes

Publikationsserver der RWTH Aachen University

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Looking at Vector Space and Language Models for IR using Density Matrices

Author: A Gleason
AI Lvovsky
B Piwowarski
C Carpineto
ChX Zhai
G Birkhoff
G Salton
G Zuccon
G Zuccon
J Rocchio
J Zobel
K Rijsbergen van
K Tsuda
M Melucci
M Melucci
M Melucci
M Melucci
MA Nielsen
MK Warmuth
S Deerwester
SKM Wong
T Hofmann
X Zhao
Publication venue
Publication date: 08/01/2014
Field of study

In this work, we conduct a joint analysis of both Vector Space and Language Models for IR using the mathematical framework of Quantum Theory. We shed light on how both models allocate the space of density matrices. A density matrix is shown to be a general representational tool capable of leveraging capabilities of both VSM and LM representations thus paving the way for a new generation of retrieval models. We analyze the possible implications suggested by our findings.Comment: In Proceedings of Quantum Interaction 201

arXiv.org e-Print Archive

Crossref

Predicting sample size required for classification performance

Author: A Vlachos
AH Briggs
AV Carneiro
C Cortes
CJ Adcock
F Olsson
F Provost
HM Kalayeh
I Scheinin
J Algina
J Cai
J Cohen
J Eng
J Yuan
J Zhu
JE Dennis
K Brinker
K Dobbin
K Fukunaga
K Nigam
KR Hess
LE Yelle
Long H Ngo
M Last
M Li
MK Warmuth
MR Jiroutek
N Boonyanunta
Qing Zeng-Treitler
RL Figueroa
Rosa L Figueroa
RV Lenth
S Kandula
S Mukherjee
S Tong
S-Y Kim
Sasikiran Kandula
SE Maxwell
SJ Walters
SL Beal
V Stalbovskaya
VH Tam
Y Liu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Supervised learning methods need annotated data in order to generate efficient models. Annotated data, however, is a relatively scarce resource and can be expensive to obtain. For both passive and active learning methods, there is a need to estimate the size of the annotated sample required to reach a performance target. Methods We designed and implemented a method that fits an inverse power law model to points of a given learning curve created using a small annotated training set. Fitting is carried out using nonlinear weighted least squares optimization. The fitted model is then used to predict the classifier's performance and confidence interval for larger sample sizes. For evaluation, the nonlinear weighted curve fitting method was applied to a set of learning curves generated using clinical text and waveform classification tasks with active and passive sampling methods, and predictions were validated using standard goodness of fit measures. As control we used an un-weighted fitting method. Results A total of 568 models were fitted and the model predictions were compared with the observed performances. Depending on the data set and sampling method, it took between 80 to 560 annotated samples to achieve mean average and root mean squared error below 0.01. Results also show that our weighted fitting method outperformed the baseline un-weighted method (p < 0.05). Conclusions This paper describes a simple and effective sample size prediction algorithm that conducts weighted fitting of learning curves. The algorithm outperformed an un-weighted algorithm described in previous literature. It can help researchers determine annotation sample size for supervised machine learning.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

In silico approach to screen compounds active against parasitic nematodes of major socio-economic importance

Author: A Harder
A Harder
A Tropsha
AJ Bokisch
AM Mayer
AM Mayer
AM Mayer
C Cortes
CE James
CY Liew
D Dutta
D Wishart
D Woods
DF Cully
E Byvatov
E Lacey
E Marchiori
GW Bemis
H Geppert
IA Sutherland
J Keiser
J Keiser
J Overington
L Holden-Dye
L Holden-Dye
MK Warmuth
MWB Trotter
O Ivanciuc
P Kohler
PA Friedman
R Burbidge
R Kaminsky
RF Freitas
RI Jennrich
RJ Martin
RN Jorissen
S Geerts
S Ranganathan
S Reddy
S-H Xiao
Shoba Ranganathan
Sr Sousa
Varun Khanna
VV Zernov
W Duch
Y Hu
Y Marrero-Ponce
Y Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Infections due to parasitic nematodes are common causes of morbidity and fatality around the world especially in developing nations. At present however, there are only three major classes of drugs for treating human nematode infections. Additionally the scientific knowledge on the mechanism of action and the reason for the resistance to these drugs is poorly understood. Commercial incentives to design drugs that are endemic to developing countries are limited therefore, virtual screening in academic settings can play a vital role is discovering novel drugs useful against neglected diseases. In this study we propose to build robust machine learning model to classify and screen compounds active against parasitic nematodes.A set of compounds active against parasitic nematodes were collated from various literature sources including PubChem while the inactive set was derived from DrugBank database. The support vector machine (SVM) algorithm was used for model development, and stratified ten-fold cross validation was used to evaluate the performance of each classifier. The best results were obtained using the radial basis function kernel. The SVM method achieved an accuracy of 81.79% on an independent test set. Using the model developed above, we were able to indentify novel compounds with potential anthelmintic activity.In this study, we successfully present the SVM approach for predicting compounds active against parasitic nematodes which suggests the effectiveness of computational approaches for antiparasitic drug discovery. Although, the accuracy obtained is lower than the previously reported in a similar study but we believe that our model is more robust because we intentionally employed stringent criteria to select inactive dataset thus making it difficult for the model to classify compounds. The method presents an alternative approach to the existing traditional methods and may be useful for predicting hitherto novel anthelmintic compounds.12 page(s

Crossref

Springer - Publisher Connector

PubMed Central

Macquarie University ResearchOnline

ScholarBank@NUS

HOG Based Radial Basis Function Network for Brain MR Image Classification

Author: F Segonne
FJ Galdames
GD Lowe
H Abdi
J Jiang
J Park
MK Warmuth
PD Sathyaa
S Chen
SR Kannan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Unshuffling Permutations

Author: A Mansfield
C Choffrut
G Duchamp
J Leeuwen van
J-C Spehner
MK Warmuth
P Bose
R Rizzi
R Simion
S Buss
S Eilenberg
SA Joni
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/03/2016
Field of study

International audienceA permutation is said to be a square if it can be obtained by shuffling two order-isomorphic patterns. The definition is intended to be the natural counterpart to the ordinary shuffle of words and languages. In this paper, we tackle the problem of recognizing square permutations from both the point of view of algebra and algorithms. On the one hand, we present some algebraic and combinatorial properties of the shuffle product of permutations. We follow an unusual line consisting in defining the shuffle of permutations by means of an unshuffling operator, known as a coproduct. This strategy allows to obtain easy proofs for algebraic and combinatorial properties of our shuffle product. We besides exhibit a bijection between square (213, 231)-avoiding permutations and square binary words. On the other hand, by using a pattern avoidance criterion on oriented perfect matchings, we prove that recognizing square permutations is NP-complete

arXiv.org e-Print Archive

Crossref

HAL Descartes

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Prediction of drug solubility on parallel computing architecture by support vector machines

Author: A Berl
C Cortes
C-C Chang
CL Blake
DE Lee
DS Cao
JH Voigt
JM Kriegl
K Brudzewski
K Hornik
MH Fatemi
MK Warmuth
O Ivanciuc
P Rajendra
RN Jorissen
X Fan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref